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Abstract 

A subset S of vertices of a graph G is called a k-path vertex cover if every 
path of order k in G contains at least one vertex from S. Denote by ipk{G) 
the minimum cardinality of a /c-path vertex cover in G. It is shown that the 
problem of determining ipk{G) is NP-hard for each k > 2, while for trees the 
problem can be solved in linear time. We investigate upper bounds on the 
value of ipkiG) and provide several estimations and exact values of ipk{G). 
We also prove that ip^iG) < {2n + m)/6, for every graph G with n vertices 
and m edges. 



1. Introduction and motivation 

In this paper we consider finite graphs without loops and multiple edges 
and use standard graph theory notations (see e.g. [25|). In particular, by 
the order of a path P we understand the number of vertices on P while the 
length of a path is the number of edges of P. 

We introduce a new graph invariant that generalizes the intensively stud- 
ied concept of vertex cover. Our research is motivated by the following prob- 
lem [2l[ related to secure communication in wireless sensor networks (shortly 
WSN). The topology of WSN can be represented by a graph, in which ver- 
tices represent sensor devices and edges represent communication channels 
between pairs of sensor devices. Traditional security techniques cannot be 
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applied directly to WSN, because sensor devices are limited in their computa- 
tion, energy, communication capabilities. Moreover, they are often deployed 
in accessible areas, where they can be rather easily captured by an attacker. 
In general, a standard sensor device is not considered as tamper-resistant 
and it is undesirable to make all devices of a sensor network tamper-proof 
due to increasing cost. Therefore, the design of WSN security protocols has 
become a challenge in security research. 

We focus on the Canvas scheme 0, @, 21, 22] which should provide data 
integrity in a sensor network. The ^-generalized Canvas scheme 21] guaran- 
tees data integrity under the assumption that at least one node which is not 
captured exists on each path of the length k — 1 in the communication graph. 
The scheme combines the properties of cryptographic primitives and the net- 
work topology. The model distinguishes between two kinds of sensor devices 
- protected and unprotected. The attacker is unable to copy secrets from a 
protected device. This property can be realized by making the protected 
device tamper-resistant or placing the protected device on a safe location, 
where capture is problematic. On the other hand, an unprotected device can 
be captured by the attacker, who can also copy secrets from the device and 
gain control over it. During the deployment and initialization of a sensor 
network, it should be ensured, that at least one protected node exists on 



each path of the length k — 1 in the communication graph [2l|. The problem 



to minimize the cost of the network by minimizing the number of protected 



vertices is formulated in 21 



Formally, let G be a graph and let A; be a positive integer. A subset of 
vertices S C V(G) is called a k-path vertex cover if every path of order k in 
G contains at least one vertex from S. We denote by ipk{G) the minimum 
cardinality of a fc-path vertex cover in G. 

Clearly, 2-path vertex cover corresponds to the well know vertex cover 
(a subset of vertices such that each edge of the graph is incident to at least 
one vertex of the set). Therefore ^2 (GO is equal to the size of the minimum 
vertex cover of a graph G. Moreover, the value oijjj^(G) corresponds to 
the so-called dissociation number of a graph 0, QlJ [23] defined as follows. 
A subset of vertices in a graph G is called a dissociation set if it induces a 
subgraph with maximum degree 1. The number of vertices in a maximum 
cardinality dissociation set in G is called the dissociation number of G and 
is denoted by diss(G). Clearly, ^ 3 (G) = \V(G)\ - diss{G). 

A related colouring is the k-path chromatic number, which is the minimum 
number of colours that are necessary for colouring the vertices of G in such 
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easy to see that ^ k {G) < Xfc( y \V(G)\. 



a way that each colour class forms a P fc -free set [H, 0, 0, 15, 2(|, and it is 

g(G)-] 



Since the minimum vertex cover problem is NP-hard [17j, it is not sur- 
prising that so is the problem of determining ip^ for each k > 3. We provide 
details in Section [2]- actually, we reduce the minimum vertex cover problem 
to the minimum fc-path vertex cover problem. 

However, since the question whether there is a /c-path vertex cover of size 
at most t can be expressed in the monadic second order logic, by famous 
Courcelle's theorem [5j, the minimum fc-path vertex cover problem can be 
solved in linear time on graphs with bounded treewidth, e.g. trees, series- 
parallel graphs, outerplanar graphs, etc. In Section [3] we determine the exact 
value of ipk for trees and present a linear time algorithm which returns an 
optimal solution for trees. Then Section @] is devoted to outerplanar graphs. 
We present a tight upper bound for ip 3 (G). 

In Section [5] we provide several estimations for the size of minimum k- 
path vertex cover on degree of its vertices. Finally, we prove that ips(G) < 
(2n + m)/6, for every graph G with n vertices and m edges. 

2. NP-completeness 

In this section we prove that the problem of determining ip^ is NP- 
complete for every k > 2. For k = 2 this equals to the vertex cover problem 



(shortly VCP) which is known to be NP-complete [17| . For an arbitrary fixed 
integer k > 2 the decision version of the fc-path vertex cover problem can be 
stated as follows: 

fc-Path Vertex Cover Problem (fc-PVCP) 
INSTANCE: Graph G and a positive integer t. 

QUESTION: Is there a fc-path vertex cover S for G of size at most tl 



Theorem 1. For any fixed integer k > 2 the k-Path Vertex Cover Problem 
is NP-complete. 

Proof. First, it is not hard to see that fc-PVCP is in NP. Note that there are 
at most 0(n h ) paths of length k — 1 in G. Thus for a given set 5* we can 
decide in polynomial time whether all paths of length k — 1 are secured by 
the vertices from S. 
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Note that 2-PVCP coincides with VCP. For k > 2 we will reduce VCP 
to fc-PVCP. Let G be an arbitrary graph. Let G' be a graph obtained from 
G by joining a path with |_^ir"J new vertices to every vertex of G (i.e. for 
k = 3 or k = 4 we attach a leaf to each vertex; for fc = 5orfc = 6a path 
on two vertices is attached to each vertex of G; etc.). We call the vertices of 
G original vertices, and others are called new. In order to finish the proof 
it suffices to show that the size of a minimum vertex cover in G is equal to 

MG')- 

Suppose that G' has a /c-path vertex cover S'. Suppose that S' contains 
also a new vertex x (i.e. it lies on an attached path). Let x' be an original 
vertex that has the smallest distance from x. Note that x secures only the 
paths that go through the vertex x', since the length of the attached path is 
less than k. Hence, if x is replaced in S' by x', all paths remain secured. By 
consecutive application of this procedure for each new vertex x we obtain a 
set S comprising only of the original vertices. Clearly, \S\ < \S'\, and by the 
procedure described above, S is also a fc-path vertex cover. We claim that 
S C V(G) forms a vertex cover in G. 

Suppose to the contrary that there is an edge xy in G whose endvertices 
x and y are not in S. Consider the path P in G', composed by the path, 
attached to x, vertices x and y, and the path attached to y. Then clearly 
P contains no vertex from S, and it has 2|_^-J + 2 > k vertices, which is a 
contradiction. Thus S is a vertex cover in G of size at most | *S'' | . 

Conversely, by a similar reasoning we can show that a vertex cover of G 
yields a fc-path vertex cover set in G' of the same size. □ 

The reduction described in the proof of Theorem [T] also shows that for 
any approximation rate r one can transform a polynomial time r-approxi- 
mation for the minimum fc-path vertex cover problem to a polynomial time 
r- approximation algorithm for the minimum vertex cover. Using a result of 
[zj], we immediately infer that it is NP-hard to approximate fc-PVCP, for 
k > 2, within a factor of 1.3606, unless P = NP. 

A well-known 2-approximation algorithm which repeatedly puts vertices 
of an edge into the constructed vertex cover and removes them from the 
graph, was discovered independently by F. Gavril and M. Yannakakis [4j]. A 
polynomial time approximation with a better constant factor is not known 
for the vertex cover problem. Similarly, for the minimum /c-path vertex 
cover problem we can construct an approximation algorithm in the following 
way: Find a path on k vertices, put its k vertices into the constructed set 
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and remove them from the graph. Since at least one of these k vertices 
belongs to an optimal solution, this provides a k- approximation algorithm 
for the minimum fc-path vertex cover with a polynomial running time while 
k is considered as a fixed constant. Note that the currently best known 
randomized algorithm for finding a path on k vertices runs in timeO(2 fc n°W), 
where n denotes the order of a graph (see [itSl]). 

3. Path vertex cover for trees 

We begin our investigation with the class of trees, that present an im- 
portant underlying communication topology for WSN. Courcelle's theorem 
[5] guarantees the existence of a linear time algorithm, basically using dy- 
namic programming. In this section, we describe such an algorithm in detail, 
and we use it then to derive a sharp upper bound ipk(T) < \V(T)\/k for an 
arbitrary tree T. 

In order to simplify our consideration, consider that the input tree is 
rooted at a vertex u. By properly rooted subtree we denote a subtree T v , 
induced by a vertex v and its descendants (with respect to u as the root) 
and satisfies the following properties: 

1. T v contains a path on k vertices; 

2. T v \ v does not contain a path on k vertices. 

The algorithm PVCPTree systematicaly searches for a properly rooted tree 
T v , puts v into a solution and removes T v from the input tree T. 

Function PVCPTree(T,k) 

Input: A tree T on n vertices and a positive integer k; 

Output: A A;-path vertex cover S of T; 

Form an arbirary vertex u G T, make T rooted in u; 

S:=0; 

while T contains a properly rooted subtree T v do 

S:=SU{?)}; 

T := T \ T v ; 
return S; 



Theorem 2. Let T be a tree and k be a positive integer. The algorithm 
PVCPTree (T, k) returns an optimal k-path vertex cover of T of size at most 
Therefore, ip k (T) < ^jpi. 
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Proof. First, we shall argue that PVCPTree returns an optimal solution. We 
prove this by induction on the number of vertices in the tree T. If T does 
not contain any path on k vertices, the empty set is the optimal solution. 
Suppose T contains a path on k vertices. Let T v be a properly rooted subtree 
of T. Since any fc-path cover of T contains a vertex of T v it follows that 
ipk{T) — ^k(T \ T v ) + 1 and hence the result follows by induction. 

To prove that the returning set S contains at most ^-jr^ vertices, we 
argue that each loop of the algorithm inserts one vertex into S and removes 
from T one properly rooted subtree having at least k vertices. □ 

Concerning the time complexity of the algorithm, it is straightforward to 
implement the algorithm PVCPTree such that the returning set S in computed 
in linear time. 

4. Outerplanar graphs 

In this section we study the 3-path vertex cover of outerplanar graphs. 
Theorem 3. Let G be an outerplanar graph of order n. Then ipziG) < |. 

Proof. We prove the statement by induction on the number of vertices of G. 

Let if be a maximal outerplanar graph such that G is its subgraph sat- 
isfying V(H) = V(G). (Recall that H is maximal outerplanar if if is outer- 
planar, but adding any edge destroys that property.) It is easy to see that if 
is 2-connected and all its inner faces are triangles. Observe that every A;-path 
vertex cover in if is a fc-path vertex cover in G, since G is a subgraph of if. 
Therefore, it suffices to find a 3-path vertex cover in if of size at most |. 

Obviously, if if consists of a single triangle, then ips(H) = 1 < | = |. 

Assume if has at least four vertices. Since if is a 2-connected outerpla- 
nar graph, the closed trail bounding the outer face contains each vertex of 
if precisely once, hence if is Hamiltonian. Let i>i,i>2, • • • ,v n be the cyclic 
ordering of vertices of H along the Hamiltonian cycle. Colour a vertex Vi 
white, if the degree of in H is 2, otherwise colour it black. Since all the 
inner faces of H are triangles, there are no two consecutive white vertices, 
unless H consists of a single triangle, which is excluded. Hence, the white 
vertices induce an independent set. The edge v ji>i+i is called good, if it has a 
white endvertex, otherwise it is bad. If all the edges incident with the outer 
faces are good, it implies that n is even, and half of the vertices are white. 
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Then the set of all black vertices is a 3-path vertex cover of size | , since the 
white vertices form an independent set. 

Assume there is a bad edge, i.e. there is at least one pair of consec- 
utive black vertices, say Vi,v i+ i. We claim that there is an edge v(Oj of 
H such that v iy v i+ i, and Vj form a triangular face adjacent to the outer 
face, all the three vertices Vi, fj+i, and Vj are black, and all the edges 
Vi+iVi+2,Vi+2Vi+3,- ■ ■ ,Vj-i,Vj (indices taken modulo n) are good. 

For the sake of contradiction, suppose that this is not the case. Let 
e = v jfj+i be a bad edge. There is a unique triangular face of H incident 
both with V{ and t>i+i> let Vj be the third vertex incident with the face. It 
is easy to see that Vj must be black. The vertices v iy v i+ i, and Vj cut the 
Hamiltonian cycle into the edge fjWj+i and two paths; let a(e) be the smaller 
of their lengths. 

Let eo = Vi Vi 0+ i be the bad edge with a(e) minimal; let Vj be the 
corresponding black vertex such that together with Vi and Uj +i they form a 
triangular face of H. Without loss of generality assume that P = v j +i,. • . ,Vj 
is the path of length er(e). Observe that P contains at least one white vertex, 
thus, it contains good edges. If all the edges Vi 0+ iv i +2,- ■ ■ , v j -i v j are g°°d, 
we are done. Otherwise, we find a bad edge e\ = v^v^+i of P. Again, there 
is a unique triangular face of H incident both with v ix and v il+ i, let Vj x be 
the third (black) vertex incident with the face. Since {vi 0+ ±,Vj } is a 2-cut 
of H separating the vertices v io+ 2,- ■ ■ ,Vj -i from the rest of the graph, we 
have Vj t G {fi +i, • • • , Vj }. But then cr(ei) < cr(e ), a contradiction with the 
choice of eo- 

Therefore, there is an edge e = v{Oj of H such that Vi, fj+i, and Vj form a 
triangular face adjacent to the outer face, all the three vertices Vi, Uj+i, and 
Vj are black, and all the edges v i+ iv i+2 ,. . . ,Vj-±Vj are good. It means that on 
the path P = v . . . , Vj black and white vertices alternate, thus, a(e) = 2s 
for some integer s. 

Let W be the set of white vertices on P, let B be the set of black vertices 
on P, including f i+1 and Vj. Obviously \ W\ = s and \B\ = s+1; vertices from 
W induce an independent set in H and v i+ i is adjacent to precisely one of 
them. Let S' be a 3-path vertex cover in the graph H' = H\{vi, v i+ i, . . . , Vj} 
of size at most - = TO ~ 2 2 S ~ 2 given by induction. Then S = S' Li {vi} U (B \ 
{ v i+i}) is a 3-path vertex cover in H of size at most n-2s ~ 2 + 1 + s — |. □ 

The bound | in Theorem [3] is the best possible, since it is easy to find 
outerplanar graphs with ips > ^ . Consider an arbitrary 2-connected outer- 
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planar graph G. Let Vx,v 2 , • • • ,v n be the ordering of its vertices along the 
Hamiltonian cycle (the boundary of the outer face). For each edge fjfj+i 
incident with the outer face (f„+i = i>i), add a new vertex Ui and two new 
edges UiVi and Wjfj+i. Let the resulting graph be H. It is easy to see that H 
is a 2-connected outerplanar graph on In vertices. We claim that ip$(H) > n. 
Let S be a 3-path vertex cover in H. Divide the vertices of H into n pairs 
of the form v^Ui. Observe that if Ui ^ S and t>j ^ S, then 6 S and 
fj-i G S 1 (otherwise there is a path on 3 vertices in H not covered by S). 
Therefore, the average number of vertices in S is either at least ~ (if at least 
one vertex of a pair is in S), or | (if no vertex is in S, then in the previous 
pair both are). Altogether, |,S| > n. 



5. Upper bounds on degree of vertices 

In this section we provide several upper bounds on path vertex cover 
based on degrees of a graph. First, recall theorem of Caro 0] and Wei 24], 
which states 

MG)<\V(G)\- TtVt 

for every graph G. Since the only graphs for which this is best-possible are 
the disjoint unions of cliques, additional structural assumptions allow im- 
provements (see e.g. 11, 12, 13, 14. [l9|). 



The problem of dissociation number of graph was also studied in [10 
where Goring et al. proved 



MG) < \V(G)\ - ^2 YTdlu) ~ £ 



- + JSia) |iV(l,) U Ar (")Kl iV (' 1 '' U N W - !> ' 

In what follows, we provide the following generalization of Caro- Wei the- 
orem for ipk(G). 

Theorem 4. For every graph G: 

MG)<\V(G)\-^J2i 



Proof. Let us arbitrarily order the vertices of G, and let us start with S 
being the empty set. One by one let us add vertex Vi to S unless at least two 
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of its neighbours are already there. The probability that Vi will eventually 
land in S, is 1+ ^ v .^ , because it is the probability that in random ordering of 
vertices of G, Vi precedes each of its neighbours except for one. From this one 
can deduce that the expected size of the set S is at least ^2 Vi( zv(G)_ (i+d(v)) ■ 
Since S is a 1-degenerated graph, S is a forest. Using Theorem [2] we get 
il>k{S) < Finally, to construct a fc-path vertex cover for G we put 

into solution V(G) \ S and all the vertices forming the minimum fc-path 
vertex cover of S. □ 

In the rest of this section, we investigate the upper bounds to the 3-path 
vertex cover based on maximum and average de gree of a graph. First, recall 



a well known decomposition theorem of Lovasz [18 



Theorem 5. If s and t are non-negative integers, and if G is a graph with 
maximum degree at most s + t + 1, then the vertex set of G can be partitioned 
into two sets which induce subgraphs of maximum degree at most s and t, 
respectively. 

The following is a straightforward generalization of Theorem for fc-sets 
(see e.g. @]). 

Theorem 6. For any graph G of maximum degree A, the vertex set of G 
can be partitioned into k sets which induce subgraphs of maximum degree at 
most -r-. Moreover, this can be computed in running time 0(A\E(G)\). 



Corollary 7. Let G be a graph of maximum degree A. Then 

MG)<t£ft\V(G)\. 



Moreover, such a solution can be computed in running time 0(A\E(G)\) . 

Proof. According to Theorem El the vertex set of G can be partitioned into 
|~— lp] sets which induce subgraphs of maximum degree at most 1. Let S 
denote the largest set of such a partition; clearly \S\ > ra^rj ■ To create a 
3-path vertex cover set for G, we protect V(G) \ S. □ 

Corollary 8. There is a linear time algorithm which returns a 3-path vertex 
cover of size at most min( ; 1 ) ; for arbitrary graph G of maximum 
degree at most 3. 
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Proof. To construct a 3-path vertex cover of size at most \E(G)\/2 one can 
repeatedly put into a solution a vertex of degree at least 2. To construct a 
3-path vertex cover of size at most |V(G)|/2 for a graph G, A(G) < 3, one 
can use Corollary [3 □ 

Next we provide an algorithm which guarantees a tight upper-bound for 
ipz based on the number of vertices and edges of a graph. 

Algorithm 1: SPARSE 3 (G) 

Input: A graph Gonn vertices; 

Output: 3-path vertex cover H, H C V(G); 

tf:=0; 

while G contains any vertex v of degree at least 4 do 
H :=HU{v}; 

Remove from G the vertex v and all edges incident with v; 
H := H U Solve_SubCubic(G); 
return H; 

Here, Solve_SubCubic(G) denotes the algorithm of Corollary [HI 

Theorem 9. Let G be a graph having n vertices and m edges. The algorithm 
SPARSE 3 (G) returns a 3-path vertex cover H of size at most 2n + m . 

Proof. The algorithm consists of two phases. The first phase repeatedly 
erases vertices of degree at least 4. After this step, we get a graph G of 
maximum degree 3. In the second phase the algorithms uses the method 
Solve_SubCubic(G). 

Let s denote the number of vertices (of degree at least 4) protected in 
the first phase, let d denote the number of vertices protected in the second 
phase. From Corollary [S] we get s + 2d < n and 4s + 2d < m. Therefore, we 
get two upper bounds for \H\. 

\H\ = s + d < s + ^ = f + f 
\H\ = s + d < s + ^ = f -8 

This implies 3 1 H\ < n + f. □ 
Corollary 10. Let G be a graph on n vertices and m edges. Then 

< — - — . 

o 
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Theorem 11. Let a,b are integers such that b < a < 2b. There is a graph 
on n vertices and m edges such that ^ = § and 

MG) > — - — • 

Proof. Let x = 3(26 — a) and y = 2 (a — b). We construct the graph G having 
two types of components: 

• x components C4 (a cycle on 4 vertices) with 4 edges, ^3(^4) = 2- 

• y components H 6 (K 6 with a perfect matching removed) with 12 edges, 
^(H 6 ) = 4. 

It is easy to see that n = 4x + 6y and m = 4x + 12y. Then 

2n + m 12x + 2Ay 

— g = g = 2x + 4y = V 3 (G). 

Moreover, 

m 2a; + 6y 6(26 - a) + 12(a - 6) 6a a 



n 2x + 3y 6(26 - a) + 6(a - 6) 66 



□ 
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